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Abstract 

The Database ofVascular Plants of Canada or VASCAN (http://data.canadensys.net/vascan) is a comprehen- 
sive and curated checklist of all vascular plants reported in Canada, Greenland (Denmark), and Saint Pierre 
and Miquelon (France). VASCAN was developed at the Universite de Montreal Biodiversity Centre and is 
maintained by a group of editors and contributors. For every core taxon in the checklist (species, subspecies, or 
variety), VASCAN provides the accepted scientific name, the accepted French and English vernacular names, 
and their synonyms/alternatives in Canada, as well as the distribution status (native, introduced, ephemeral, 
excluded, extirpated, doubtful or absent) of the plant for each province or territory, and the habit (tree, shrub, 
herb and/or vine) of the plant in Canada. For reported hybrids (nothotaxa or hybrid formulas) VASCAN also 
provides the hybrid parents, except if the parents of the hybrid do not occur in Canada. All taxa are linked to 
a classification. VASCAN refers to a source for all name, classification and distribution information. 



* see Appendix. 

The article is also available on GitHub at https://github.com/peterdesmet/vascan-data-paper 



Copyright Peter Desmet, Luc Brouillet This is an open access article distributed under the terms of the Creative Commons Attribution License 3.0 
(CC-BY), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. 
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All data have been released to the public domain under a CCO waiver and are available through Canaden- 
sys and the Global Biodiversity Information Facility (GBIF) . VASCAN is a service to the scientific commu- 
nity and the general public, including administrations, companies, and non-governmental organizations. 

Keywords 

VASCAN, Canadensys, Canada, Greenland, Saint Pierre and Miquelon, checklist, taxonomy, synonymy, 
hybrids, vernacular names, English, French, distribution, provinces, habit, open data 



Data published through 

The Canadensys repository: doi: 1 0.5886/ lbft7W5f 



Project details 

Project title 

Database of Vascular Plants of Canada (VASCAN) 



Personnel 

Data compilation editors: Luc Brouillet (Universite de Montreal Biodiversity Cen- 
tre): coordination, taxonomic and geographic compilation, Frederic Coursol (Montreal 
Botanical Garden): taxonomic and geographic compilation, Susan Meades (Northern 
Ontario Plant Database): taxonomic compilation, Marc Favreau (Translation Bureau, 
Public Works and Government Services Canada): French vernacular names compila- 
tion, Marilyn Anions (botanist, Ottawa): English vernacular names compilation. 

Development: Peter Desmet (Universite de Montreal Biodiversity Centre): coor- 
dination and web design, Pierre Belisle (Universite de Montreal Biodiversity Centre): 
development, Christian Gendreau (Universite de Montreal Biodiversity Centre): de- 
velopment and maintenance, David Shorthouse (Universite de Montreal Biodiversity 
Centre): coordination, Patrick O'Reilley (Universite de Montreal): initial data import. 



Funding 

Partial funding came from Parks Canada, the Canadian Biodiversity Information Fa- 
cility (CBIF), NatureServe Canada, the Canadian Foundation for Innovation (CFI), 
and the Gouvernement du Quebec (grant to the Universite de Montreal Biodiversity 
Centre and Canadensys). Most of the compilation work, however, was contributed in 
kind by the home institution of each collaborator. 
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Study area description 

The study area occupies the northern half of North America (excluding Alaska). The area 
of Canada is 9,984,670 km 2 , of Greenland (or Kulaalit Nunaat, an autonomous country 
within the kingdom of Denmark) 2,166,086 km 2 , and Saint Pierre and Miquelon (col- 
lectivite territoriale, France) 242 km 2 . The latter is 20 km off the coast of Newfound- 
land's Burin Peninsula and its characteristics are those of boreal eastern Canada. From 
west to east, the main physiographic regions are the Western Cordillera, the sedimentary 
Interior Plains, the Canadian and Greenland Shields (mostly igneous rocks), the sedi- 
mentary Great Lakes and St. Lawrences Lowlands, and the Appalachian Mountains. The 
sedimentary Hudson Bay Lowlands basin lies at the centre of the shield, a northern area 
of sedimentary plains and mountains. The Canadian Arctic borders the Arctic Ocean in 
northern Canada and northern Greenland. An ice cap covers 8 1 % of Greenland. 

The dominant vegetation type of the area is the boreal forest, which occupies much 
of Canada from Yukon and northeastern British Columbia to Newfoundland. To the 
north, Arctic tundra prevails: it can be divided into low Arctic (with a nearly continuous 
plant cover, sometimes shrubby) and high Arctic (including polar deserts); these types are 
the only ones found in Greenland. To the south of the boreal forest, from west to east, are 
the humid Pacific Coastal forest, the Cordilleran forest, the Prairie grasslands, the eastern 
temperate forests (southern Ontario and Quebec), and the Atlantic or Acadian forests. 

The population of Canada is concentrated in a narrow belt along its border with 
the United States, where most of the impacts on ecosystems (urbanization, agriculture) 
is concentrated. Logging, mining, and hydroelectric development occur in the boreal 
forest, and mining is now rapidly developing in the Arctic. About 9.9% (Environ- 
ment Canada 201 1) of the terrestrial area of Canada is protected (7.5% according to 
the World Bank 2013) and 40% of Greenland. Based on the data in VASCAN, the 
area harbors a total of 5,124 vascular plant species, 3,829 native and 1,295 introduced 
(25% of the flora). Of the native species of Canada, 156 are considered legally at risk, 
with a further 34 of conservation concern (COSEWIC 2009+) . 



Design description 

The goal of the Database of Vascular Plants of Canada (VASCAN) is to provide an up- 
to-date, documented checklist of the names of vascular plants in Canada, Greenland, 
and Saint Pierre and Miquelon, both scientific and vernacular, and the distribution of 
the plants at the provincial/territorial level. 

VASCAN was developed from the need to validate vascular plant name and distri- 
bution data from eastern Canada (Ontario and eastward), Greenland, and Saint Pierre 
and Miquelon for the Flora of North America project (FNA) and from the need to 
provide French vernacular names for taxa present in Quebec in the FNA. It expanded 
when Parks Canada wanted to harmonize the names from vascular plant species lists 
of its parks across the country. At the time we also realized that - aside from The 
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Flora of Canada by Scoggan (1978-1979) that was in need of updating - not only was 
there no standardized scientific name list for the country - despite worthwhile efforts 
from Kartesz (1999) and USDA NRCS (2011) - but also no standardized source of 
Canadian English and French vernacular names. Names used for plants in English 
Canada are not necessarily those used in the United States, and thus U.S. sources were 
not always appropriate for this goal. Finally, several national organizations, such as 
Parks Canada, Forest Canada, the Committee on the Status of Endangered Wildlife in 
Canada (COSEWIC), and NatureServe Canada, expressed the need for a web-based 
list of Canadian taxa, with data on provincial/territorial distribution. 



Taxonomic coverage 

This checklist covers all vascular plants (Equisetopsida, Tracheophytd) reported in the 
area described in the section 'Spatial Coverage' (Figure 1). The core taxa considered are 
species, subspecies or varieties, and their hybrids. For these taxa, we provide synonyms, 
the accepted and alternative French and English vernacular names, and the habit (tree, 
shrub, herb and/or vine) of the plant in Canada. For reported hybrids (nothotaxa or 
hybrid formulas) we also indicate the hybrid parents, except if the parents of the hy- 
brid do not occur in Canada. This core information is not provided for higher taxa, 
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Figure I. Taxonomic distribution of accepted species per family from the Database of Vascular Plants 
of Canada (VASCAN). The families are ordered by total number of species. Families with less than 80 
species are grouped in 'Other families'. 
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although the calculated distribution based on lower taxa can be consulted and down- 
loaded from the VASCAN website (http://data.canadensys.net/vascan). 

All taxa are linked to a classification: Chase and Reveal (2009) for the higher clas- 
sification, Christenhusz et al. (2011a) for lycophytes, Smith et al. (2006) for monilo- 
phytes (modified in Rothfells et al. 2012), Christenhusz et al. (2011b) for the gym- 
nosperms, and the Angiosperm Phylogeny Group (2009) for flowering plants. At the 
generic level and below, the Flora of North America Editorial Committee (1993+) is 
the main source of classification, unless taxonomic literature more recent than the vol- 
ume published for a given taxon provides a taxonomy more reflective of current data. 
The source used is indicated for each taxon in the dataset. 

The classification includes 16 ranks. They are, in hierarchical order: class, subclass, 
superorder, order, family, subfamily, tribe, subtribe, genus, subgenus, section, subsec- 
tion, series, species, subspecies and variety. Varieties within subspecies are accepted, so 
quadrinomial names are present, but forms are not included. 



Taxonomic ranks 

Kingdom: Plantae 

Class: Equisetopsida 

Family: Acanthaceae, Acoraceae, Adoxaceae, Alismataceae, Amaranth aceae, Amaryllidace- 
ae, Anacardiaceae, Annonaceae, Apiaceae, Apocynaceae, Aquifoliaceae, Araceae, Araliaceae, 
Aristolochiaceae, Asparagaceae, Aspleniaceae, Asteraceae, Athyriaceae, Balsaminaceae, Ber- 
beridaceae, Betulaceae, Bignoniaceae, Blechnaceae, Boraginaceae, Brassicaceae, Butomaceae, 
Buxaceae, Cabombaceae, Cactaceae, Campanulaceae, Cannabaceae, Caprifoliaceae, Caryo- 
phyllaceae, Celastraceae, Ceratophyllaceae, Cistaceae, Cleomaceae, Clethraceae, Colchicaceae, 
Commelinaceae, Convolvulaceae, Cornaceae, Crassulaceae, Cucurbitaceae, Cupressaceae, 
Cyperaceae, Cystopteridaceae, Dennstaedtiaceae, Diapensiaceae, Dioscoreaceae, Droseraceae, 
Dryopteridaceae, Elaeagnaceae, Elatinaceae, Equisetaceae, Ericaceae, Eriocaulaceae, Eu- 
phorbiaceae, Fabaceae, Fagaceae, Frankeniaceae, Gentianaceae, Geraniaceae, Grossulari- 
aceae, Haemodoraceae, Haloragaceae, Hamamelidaceae, Hydrangeaceae, Hydrocharitaceae, 
Hymenophyllaceae, Hypericaceae, Hypoxidaceae, Iridaceae, hoetaceae, Juglandaceae, Jun- 
caceae, Juncaginaceae, Lamiaceae, Lauraceae, Lentibulariaceae, Liliaceae, Limnanth aceae, 
Linaceae, Linderniaceae, Loasaceae, Loranthaceae, Lycopodiaceae, Lythraceae, Magnoli- 
aceae, Malvaceae, Marsileaceae, Martyniaceae, Melanthiaceae, Melastomataceae, Men- 
ispermaceae, Menyanthaceae, Molluginaceae, Montiaceae, Moraceae, Myricaceae, Nartheci- 
aceae, Nelumbonaceae, Nyctaginaceae, Nymphaeaceae, Oleaceae, Onagraceae, Onocleaceae, 
Ophiogbssaceae, Orchidaceae, Orobanchaceae, Osmundaceae, Oxalidaceae, Paeoniaceae, 
Papaveraceae, Penthoraceae, Phrymaceae, Phytolaccaceae, Pinaceae, Plantaginaceae, Platan- 
aceae, Plumbaginaceae, Poaceae, Podostemaceae, Polemoniaceae, Polygalaceae, Polygonaceae, 
Polypodiaceae, Pontederiaceae, Portulacaceae, Potamogetonaceae, Primulaceae, Pteridaceae, 
Ranunculaceae, Resedaceae, Rhamnaceae, Rosaceae, Rubiaceae, Ruppiaceae, Rutaceae, Sali- 
caceae, Salviniaceae, Santalaceae, Sapindaceae, Sarraceniaceae, Saururaceae, Saxifragaceae, 
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Scheuchzeriaceae, Schizaeaceae, Scrophulariaceae, Selaginellaceae, Simaroubaceae, Smi- 
lacaceae, Solanaceae, Staphyleaceae, Tamaricaceae, Taxaceae, Thelypteridaceae, Thymelae- 
aceae, Tofieldiaceae, Typhaceae, Ulmaceae, Urticaceae, Verbenaceae, Violaceae, Vitaceae, 
Woodsiaceae, Xanthorrhoeaceae, Xyridaceae, Zostemceae, Zygophyllaceae. 



Common names 

Vascular plants, Lycopods, ferns, conifers, flowering plants. In the dataset, French and 
English vernacular names are provided for families, species, subspecies, and varieties. 



Spatial coverage 

The checklist covers all vascular plants reported in Canada, Greenland (Denmark), and 
Saint Pierre and Miquelon (France) (Figure 2). The latter two regions are added because 
their floras are intimately related to that of Canada and it is useful for Canadians and others 
to know about them. Provincial distributions are provided to help Canadians visualize the 
relationship among the floras of their provinces and territories. VASCAN does not intend to 
replace regional or provincial lists but to act as a complement to them. The covered regions 
are, in alphabetical order: Alberta, British Columbia, Greenland, Labrador, Manitoba, New 
Brunswick, Newfoundland, Northwest Territories, Nova Scotia, Nunavut, Ontario, Prince 
Edward Island, Quebec, Saint Pierre and Miquelon, Saskatchewan, and Yukon. 
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Figure 2. Regional distribution of accepted species from the Database of Vascular Plants of Canada 
(VASCAN). For each region, the number of native, introduced and ephemeral species is shown, i.e. spe- 
cies with a confirmed presence in the region. The regions are ordered by total number of species. 
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The distribution status of the plant is indicated per region. These can be grouped 
as present (native, introduced or ephemeral), previously reported but currently consid- 
ered absent (excluded, extirpated), doubtful or not reported (absent). The latter status 
is not recorded in the database (null value). Excluded taxa are those considered not 
currently occurring in a region, due either to non-recurring ephemeralness, misidenti- 
fication, lack of supporting documentation, or when specimens are old and the taxon 
has not been observed again in more than 50 years. All distribution statuses are defined 
at http://data.canadensys.net/vascan/about/#distribution. 

The VASCAN website (http://data.canadensys.net/vascan) provides a distribution 
map for each taxon. For higher taxa, these are calculated based on lower taxa, with 
the distribution statuses ordered as such: native, introduced, ephemeral, excluded, ex- 
tirpated, doubtful, absent. E.g., if two species within the same genus are respectively 
native and doubtful in a certain region, the genus is considered native for that region. 

The website also provides a checklist builder (http://data.canadensys.net/vascan/ 
checklist), where users can generate their own list of taxa based on several criteria (tax- 
onomy, region, distribution status, or a combination of these) and download this as a 
Darwin Core Archive or text file. 



Bounding box for covered area 

4l°40'N and 83°40'N latitude; 14 TOO 'W and 1 1° 19 'W longitude 



Temporal coverage 

17th to 21st century. 



Sampling methods 

Study extent description 

See the section 'Spatial coverage' and 'Project details - Study area description'. 



Sampling description 

The data are sampled manually from literature by the editors, though recent additions 
are based on specimens maintained at institutional herbaria across Canada (see Thiers). 

All floras covering Canada, Greenland, and Saint Pierre and Miquelon were con- 
sidered for literature-based data entry, but only the most recent provincial and ter- 
ritorial floras (see the section 'References - References used to build the dataset') were 
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systematically searched to establish the distribution status of each taxon in each region 
(see the section 'Spatial coverage'). Scoggan's Flora of Canada (1978-1979) was sys- 
tematically searched, as were Kartesz ( 1 999) and the Flora of North America (FNA Ed. 
Comm. 1993+). English and French vernacular names are based on usage in Canada 
and, for introduced taxa, on vernaculars from the countries of origin (when the taxon 
is from Europe). Alternate (synonym) vernaculars are provided when several names 
are in usage (notably regional names), but an accepted vernacular is recommended for 
general usage throughout the country. The method of selection of vernacular names 
follows Darbyshire et al. (2000). The source of the information is referenced for all 
scientific names, vernacular names and distributions in the dataset. 



Quality control description 

New findings or corrections for plant distributions are communicated to the editors by 
contributors from each region (Appendix). Contributors are local botanists, often as- 
sociated with Canadian herbaria or Conservation Data Centers. All new reports must 
be documented by specimens deposited at institutional herbaria. 

Suggestions or corrections regarding names, taxonomy, or functionality of the VAS- 
CAN website are submitted by users and reviewers through a public Google Code is- 
sue tracker at http: //code. google. com/p/canadensys/issues/list?can=2&q=label:vascan. 
Name suggestions are validated by the editors against names in Tropicos (http://www. 
tropicos.org), IPNI (http://www.ipni.org), GRIN (http://www.ars-grin.gov), or other 
plant name databases, before being manually corrected in the database. 



Dataset 

The data are stored in a relational database (MySQL), which powers the search, check- 
list builder, taxon and name pages of the VASCAN website. Editors update a develop- 
ment copy of the database through a secure web application. This allows them to make 
revisions without affecting the users of the website. Once they agree that the data are 
consistent, in which they are aided by the application, they can push that version of 
the database to production. 

At that moment, the application will also automatically generate a Darwin Core 
Archive of the data, using the GBIF GNA Profile (GBIF 2010) and following best 
practices for publishing checklists (GBIF 2011). This archive (Figure 3) includes all 
data, except for calculated distributions, hybrid parents, and user credentials. The ar- 
chive is then manually uploaded to the Canadensys Repository (http://data.canaden- 
sys.net/ipt), a GBIF Integrated Publishing Toolkit, and republished, at which time 
it will be assigned a new version number (version 24 at the time of publication). The 
dataset is registered with the Global Biodiversity Information Facility (GBIF), which 
allows that organization to harvest, display and distribute the data at any time. 
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taxon.txt 

Scientific name, taxonomic 
status, classification, source 
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Region, distnbution status, 
source 
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Vernacular name, status, 
language, source 
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Figure 3. The VASCAN Darwin Core Archive, structured following the GBIF GNA Profile. It is a 
compressed folder containing 4 text files with tab-seperated values and 2 xml files. Taxon and scientific 
name information is provided in taxon. txt, with one record for each taxon and child-parent-relationships 
representing the classification. Records in the extension files distribution, txt, vernacularname. txt and de- 
scription, txt have a many- to-one relation with the records in taxon. txt and provide additional information 
for each taxon. The archive structure and term definitions are described in meta.xml. The dataset metadata 
are provided in eml.xml. 



To the extent possible under law, the Universite de Montreal Biodiversity Centre 
has waived all copyright and related or neighboring rights to this dataset, releasing it to 
the public domain under a CCO waiver. Users of the data are encouraged to follow the 
Canadensys norms for data use and publication (http://www.canadensys.net/norms): 

Give credit where credit is due: As is common practice in scientific research, cite the 
data you are using. 

Be responsible: Use the data responsibly. The data are published to allow anyone 
to better study and understand the world around us, so please do not use the data in 
any way that is unlawful, harmful or misleading. Understand that the data are subject 
to change, errors and sampling bias. Protect the reputation of the data publisher and 
clearly indicate any changes you may have made to the data. 

Share knowledge: Let us know if you have used the data. It helps us to showcase our 
efforts and it helps you reach a wider audience. Inform us if you have comments about 
the data, notice errors, or want more information. 

Respect the data license: Understand and respect the data waiver under which the 
data are published. To help you make greater use of the data, we have dedicated the 
data to the public domain (CCO). Do not remove the public domain mark or provide 
misleading information about the copyright status. 
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Object name: Darwin Core Archive for the Database of Vascular Plants of Canada 
(VASCAN) 

Character encoding: UTF-8 

Format name: Darwin Core Archive format 

Format version: 1.0 

Distribution: doi: 10.5886/Y7SMZY5P 

Publication date of data: 2013-07-22 

Language: English 

Licenses of use: http://creativecommons.Org/publicdomain/zero/l.0/ & http://www. 

canadensys .net/ norms 

Metadata language: English 

Date of metadata creation: 2013-07-22 

Hierarchy level: Dataset 

Suggested citation for the latest version of the dataset 

Brouillet L, Desmet P, Coursol F, Meades SJ, Favreau M, Anions M, Belisle P, Gend- 
reau C, Shorthouse D and contributors (2010+) Database of Vascular Plants of Cana- 
da (VASCAN). Online at http://data.canadensys.net/vascan, doi: 10.5886/lbft7W5f, 
and http://data.gbif.org/datasets/resource/13558, released on 2010-12-10. Version 
[xx]. GBIF key: 3f8al297-3259-4700-91fc-acc4l70b27ce. Data paper ID: http:// 
dx.doi.org/10.3897/phytokeys.25.3100 [accessed on [date]] 



External datasets 

Object name: GBIF data portal 
Character encoding: UTF-8 
Format name: various formats 

Distribution: http: / / data.gbif. org/ datasets/ resource/ 13558 
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Appendix 
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